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SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Schulein, Martin 

6 Andersen, Lene N. 

7 Lassen, Soren F. 

8 Kauppinen, Markus S. 

9 Lange, Lene 

10 Nielsen, Ruby I. 

11 Ihara, Michiko 

12 Takagi, Shinobu 
14 (ii) TITLE OF INVENTION: Novel Endoglucanases 
16 (iii) NUMBER OF SEQUENCES: 109 

18 (iv) CORRESPONDENCE ADDRESS: 

19 (A) ADDRESSEE: Novo Nordisk of North America, Inc. 

20 (B) STREET: 405 Lexington Avenue, 64th Floor 

21 (C) CITY: New York 

22 (D) STATE: New York 

23 (E) COUNTRY: United States of America 

24 (F) ZIP: 10174-6401 

26 (v) COMPUTER READABLE FORM: 

27 (A) MEDIUM TYPE: Floppy disk 

28 (B) COMPUTER: IBM PC compatible 

29 (C) OPERATING SYSTEM: PC -DOS/MS -DOS 

30 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
32 (vi) CURRENT APPLICATION DATA: 

c „> 33 (A) APPLICATION NUMBER: US/10/007 r 521 

C--> 34 (B) FILING DATE: 10-Dec-2001 

35 (C) CLASSIFICATION: 

38 (vii) PRIOR APPLICATION DATA: 

39 (A) APPLICATION NUMBER: US/08/651,136 

40 (B) FILING DATE: 21-MAY-1996 

43 (viii) ATTORNEY/AGENT INFORMATION: 

44 (A) NAME: Lambiris, Elias J. 

45 (B) REGISTRATION NUMBER: 3 3,728 

46 (C) REFERENCE/DOCKET NUMBER: 4366. 200- US 

48 (ix) TELECOMMUNICATION INFORMATION: 

49 ^ (A) TELEPHONE: 212-867-0123 

50 (B) TELEFAX: 212-878-9655 
53 (2) INFORMATION FOR SEQ ID NO : 1: 

55 (i) SEQUENCE CHARACTERISTICS: 

56 (A) LENGTH: 960 base pairs 

57 (B) TYPE: nucleic acid 

58 (C) STRANDEDNESS : single 
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59 






(D) TOPOLOGY: 


linear 




















61 




(ii) 


MOLECULE TYPE: 


cDNA 




















63 




(ix) 


FEATURE : 


























64 






(A) NAME/KEY : 


CDS 






















65 






(B) LOCATION: 


113 . 


.787 




















67 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 1: 














69 


AAAGAAAGGC TCTCTGCTGT CGTCGCTCTC 


GTCGCTCTCG 


TCGGCATCCT CCATCCGTCC 


fin 


71 


GCCTTTGATA ACCCGCTCCC CGACTCAGTC 


AAGACGACGC 


ATACTTGGCA CC ATG 


115 


72 






























Met 




73 
































1 




75 


CAT 


CTC 


TCC 


GCC 


ACC 


ACC 


ppp 


TTC 
x x v_» 


CTC 


CCC 


PTP 


CCC 


PTP 


PTP 


CCC 

VJ V_, V_, 


PTP 


i fi ^ 


76 


His 


Leu 


Ser 


Ala 


Thr 


Thr 


\j J- y 


Phe 


ucu 


Al a 


Leu 


rivj 


v a x. 


JJC Li 


Ala 






77 








5 










10 










15 








79 


GAC 


CAG 


CTC 


TCG 


ppp 

V3 O V_, 


ATP 


CCC 


CAC 


ACC 


ACC 


ppp 


TAP 


TCC 

X VjVj 


PAP 


TGC 


TGC 




80 


Asp 


Gin 


Leu 


Ser 


Glv 


lie 


Gly 


Gin 


Thr 


Thr 


Arg 


Tvr 


frn 


Asp 


Cys 


Cys 




81 






20 










25 










30 










83 


AAG 


CCG 


AGC 


TGC 


GCC 


TGG 


CCC 


ppp 


AAG 


ppp 

VJ VJ v^. 


CCC 


TPP 


TPT 


CCC 


PTP 


PAP 


Z J Z? 


84 


Lys 


Pro 


OCX 


y o 


Ala 




IT X. \J 


Pi V 

\j x. y 


T.VQ 

j-i y o 






C 0 7~ 


Cor 
JCI 


Prn 
ri u 


Val 


Gin 








35 










40 










, 45 












87 


GCC 


TGC 


GAC 


AAG 


AAC 


CAC 


AAC 


CCC 


CTC 


AAC 


pap 




CCC 


TPP 


APP 


CGG 


•j \J / 


88 


Ala 


Cys 


Asp 


Lys 


Asn 


Asp 


Asn 


Prn 

C X. VJ 


JJCU 




Jen 
flop 


Gl V 


m v 


Cpr 


Thr 


Arg 




89 


50 










55 










60 














91 


TCC 


GGC 


TGC 


GAC 


GCG 


ppp 


CCC 


ACC 


CCC 


TAP 


atp 


TPP 


TPP 


TPP 


PAP 


APP 


J J J 


92 


Ser 


Glv 


Cys 


Asp 


Ala 


Glv 


Glv 


Ser 


Ala 


Tvt* 


Met 


Cys 


Ser 


Ser 


Gin 


Ser 




93 










70 










75 










80 






95 


CCC 


TGG 


GCC 


GTC 


AGC 


GAC 


GAG 


CTG 


TCG 


TAC 


GGC 


TGG 


GCG 


GPP 


GTC 


AAG 


a n i 

*t VJ J 


96 


Pro 




Ala 


Val 


Ser 


Asp 


Glu 


Leu 


Ser 


Tvt* 


CI v 

\j j- y 


Trn 

1 X F 


Ala 


Ala 


Val 


Lys 




97 








85 










90 










95 








99 


/-imp 


PPP 


CCC 


ACC 


TCC 




TCC 


CAC 


TCC 
X oo 


TCC 


TCC 




TPP 


TAP 


GAG 


CTG 


d SI 


100 


Leu 


Ala 


Gly 


Ser 


Ser 


Glu 


Ser 


Gin 


Trp 


Cys 


Cys 


Ala 


Cys 


Tyr 


Glu 


Leu 




101 






100 










105 










110 










103 


ACC 


TTC 


ACC 


AGC 


GGG 


CCG 


GTC 


GCG 


GGC 


AAG 


AAG 


ATG 


ATT 


GTG 


CAG 


GCG 


4QQ 

** _7 27 


104 


Thr 


Phe 


Thr 


Ser 


Gly 


Pro 


Val 


Ala 


Gly 


Lys 


Lys 


Met 


He 


Val 


Gin 


Ala 




105 




115 










120 










125 












107 


ACC 


AAC 


ACC 


GGT 


GGC 


GAC 


CTG 


GGC 


GAC 


AAC 


CAC 


TTT 


GAC 


CTG 


GCC 


ATC 


S47 

j *± / 


108 


Thr 


Asn 


Thr 


Gly 


Gly 


Asp 


Leu 


Gly 


Asp 


Asn 


His 


Phe 


Asp 


Leu 


Ala 


He 




J. VJ J 


130 










135 










140 










145 




111 


CCC 


GGT 


GGC 


GGT 


GTC 


GGT 


ATT 


TTC 


AAC 


GCC 


TGC 


ACC 


GAC 


CAG 


TAC 


GGC 


595 


112 


Pro 


Gly 


Gly 


Gly 


Val 


Gly 


He 


Phe 


Asn 


Ala 


Cys 


Thr 


Asp 


Gin 


Tyr 


Gly 




113 










150 










155 










160 






115 


GCT 


CCC 


CCG 


AAC 


GGC 


TGG 


GGC 


GAC 


CGC 


TAC 


GGC 


GGC 


ATC 


CAT 


TCC 


AAG 


643 


116 


Ala 


Pro 


Pro 


Asn 


Gly 


Trp 


Gly 


Asp 


Arg 


Tyr 


Gly 


Gly 


He 


His 


Ser 


Lys 




117 








165 










170 










175 








119 


GAA 


GAG 


TGC 


GAA 


TCC 


TTC 


CCG 


GAG 


GCC 


CTC 


AAG 


CCC 


GGC 


TGC 


AAC 


TGG 


691 


120 


Glu 


Glu 


Cys 


Glu 


Ser 


Phe 


Pro 


Glu 


Ala 


Leu 


Lys 


Pro 


Gly 


Cys 


Asn 


Trp 




121 






180 










185 










190 










123 


CGC 


TTC 


GAC 


TGG 


TTC 


CAA 


AAC 


GCC 


GAC 


AAC 


CCG 


TCG 


GTC 


ACC 


TTC 


CAG 


739 


124 


Arg 


Phe 


Asp 


Trp 


Phe 


Gin 


Asn 


Ala 


Asp 


Asn 


Pro 


Ser 


Val 


Thr 


Phe 


Gin 




125 




195 










200 










205 
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127 GAG GTG GCC TGC CCG TCG GAG CTC ACG TCC AAG AGC GGC TGC TCC CGT 787 

128 Glu Val Ala Cys Pro Ser Glu Leu Thr Ser Lys Ser Gly Cys Ser Arg 

129 210 215 220 225 

131 TAAGAGGGAA GAGAGGGGGC TGGAAGGACC GAAAGATTCA ACCTCTGCTC CTGCTGGGGA 847 
133 AGCTCGGGCG CGAGTGTGAA ACTGGTGTAA ATATTGTGGC ACACACAAGC TACTACAGTC 907 
135 CGTCTCGCCG TCCGGCTAAC TAGCCTTGCT GCGGATCTGT CCAAAAAAAA AAA 9 60 

138 (2) INFORMATION FOR SEQ ID NO : 2: 

140 (i) SEQUENCE CHARACTERISTICS: 

141 (A) LENGTH: 225 amino acids 

142 (B) TYPE: amino acid 
14 3 (D) TOPOLOGY: linear 
14 5 (ii) MOLECULE TYPE: protein 

147 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

149 Met His Leu Ser Ala Thr Thr Gly Phe Leu Ala Leu Pro Val Leu Ala 

150 1 5 10 15 

152 Leu Asp Gin Leu Ser Gly He Gly Gin Thr Thr Arg Tyr Trp Asp Cys 

153 20 25 30 

155 Cys Lys Pro Ser Cys Ala Trp Pro Gly Lys Gly Pro Ser Ser Pro Val 

156 35 40 45 

158 Gin Ala Cys Asp Lys Asn ^Asp Asn Pro Leu Asn Asp Gly Gly Ser Thr 

159 50 55 60 

161 Arg Ser Gly Cys Asp Ala Gly Gly Ser Ala Tyr Met Cys Ser Ser Gin 

162 65 ' 70 75 80 

164 Ser Pro Trp Ala Val Ser Asp Glu Leu Ser Tyr Gly Trp Ala Ala Val 

165 85 90 95 

167 Lys Leu Ala Gly Ser Ser Glu Ser Gin Trp Cys Cys Ala Cys Tyr Glu 

168 100 105 HO 

170 Leu Thr Phe Thr Ser Gly Pro Val Ala Gly Lys Lys Met He Val Gin 

171 115 120 125 

173 Ala Thr Asn Thr Gly Gly Asp Leu Gly Asp Asn His Phe Asp Leu Ala 

174 130 135 140 

176 He Pro Gly Gly Gly Val Gly He Phe Asn Ala Cys Thr Asp Gin Tyr 

177 145 ' 150 155 160 

179 Gly Ala Pro Pro Asn Gly Trp Gly Asp Arg Tyr Gly Gly He His Ser 

180 ~ 165 170 175 

182 Lys Glu Glu Cys Glu Ser Phe Pro Glu Ala Leu Lys Pro Gly Cys Asn 

183 180 185 190 

185 Trp Arg Phe Asp Trp Phe Gin Asn Ala Asp Asn Pro Ser Val Thr Phe 

186 195 2 00 2 05 

188 Gin Glu Val Ala Cys Pro Ser Glu Leu Thr Ser Lys Ser Gly Cys Ser 

189 210 215 220 

191 Arg 

192 225 

195 (2) INFORMATION FOR SEQ ID NO: 3: 

197 (i) SEQUENCE CHARACTERISTICS: 

198 (A) LENGTH: 894 base pairs 

199 (B) TYPE: nucleic acid 

200 (C) STRANDEDNESS : single 

201 (D) TOPOLOGY: linear 



file://C:\CRF3\Outhold\VsrJ007521 .htm 



2/4/02 



Page 4 of 8 



RAW SEQUENCE LISTING DATE: 02/04/2002 

PATENT APPLICATION: US/10/007 , 521 TIME: 11:36:15 



Input Set : N:\Crf3\RULE60\10007521.raw 
Output Set: N:\CRF3\0204 2002\J007521.raw 







z \J J . 






MOLECULE TYPE: 


cDNA 
























on ^ 






FEATURE : 






























o n £ 
ZU o 






(A) NAME/KEY: 


CDS 


























OAT 
ZU / 






(B) LOCATION: 


1. .891 
























on q 
zu y 




(XI ) 


SEQUENCE DESCRIPTION: SEQ ID NO: 3: 


















Oil 

Z lx 


£\L\3 


CHI 


CTC 


TCC 


GCC 


ACC 


ACC 


GGG 


TTC 


CTC 


GCC 


CTC 


CCG 


GTC 


CTG 


GCC 


48 






01 0 

z xz 




nib 




Dei 


Ala 


X 11X 


Thr 


vjxy 


Phe 


Leu 


Ala 


Leu 


Pro 


Val 


Leu 


Ala 




W- 


V 

- * 


Z 1 J 










230 










235 










24 0 










ZlD 








PTP 


tpp 


PPP 

VJVJV— 


ATC 


GGC 


CAG 


ACG 


ACC 


CCC 


TAP 
x n v_ 


X o o 


GAC 


TGC 


96 






Zx O 


Leu 


Asp 




i-JC- u 


OCl 


ni v 


X xc 


C\ v 
vjxy 


Gin 


Thr 


Thr 


Arg 


Tvr 
x y x 


Trn 


Asp 


Cys 




w- 


- > 


Z 1 / 








Od 5 










250 










255 












Z19 


inn/"* 
TGC 


A A 


ccc 


ahp 


TPP 


CCC 


TGG 


CCC 


GGC 


AAG 


GGC 


ccc 


TCC 


TfT 
x v^ x 


CCC 

V_» V, VJJ 


GTG 


144 






O O ft 
ZZU 


Cys 


Lys 


C X\\J 


Cor 

OCX 


tyo 


Al a 


Trn 
AI F 


Pro 


Glv 
vjxy 


Lys 


Glv 

oxy 


Pro 


Ser 


Ser 


Pro 


Val 




W- 


- > 


221 






ofin 

4 uu 










265 










A / V/ 














223 


CAG 




X U\- 


PAP 




AAP 


GAC 


AAC 


CCG 


CTC 


AAC 


CAC 


uVJL 


ccc 


TCP 


APP 


192 






ZZ4 


\J±li 


Ala 




Aon 
no p 


T \;c 

i_i_y O 


Aon 
Moll 


Aon 
nap 


Asn 


Pro 


Leu 


Asn 


A cn 
nop 


Gly 


oxy 


Ser 


Thr 




w- 


- > 


ZZD 




Z /D 










280 










A O J 
















zz / 




ILL 




TPP 


PAP 


CCC 




GGC 


AGC 


GCC 


TAC 


ATfi 
nl vj 


Tnr 

-L V3 V— 


TCC 


TCC 


CAG 


240 






ZZ o 


Arg 


Ser 


o±y 




A en 
no LJ 


Ala 


Ci v 
vj i y 


flu 

vj xy 


OCX 


Ala 


T VT 

xy x 


Met 


Cys 


Ser 


Ser 


Gin 




w- 


- > 


229 


O Q rt 

zy u 










295 










300 










305 








231 


AGC 


ccc 


X VJO 


CCC 


PTP 


AGC 


GAC 


GAG 


CTG 


TCG 


TAC 


CCC 


TCC 




ppp 


GTC 


288 






O T O 

Z 5 Z 


be! 


Pro 


irp 


Ala 
n x a 


Va 1 
v ax 


O o "f 


ficn 
nap 


Glu 


Leu 


Ser 


Tvt 
j.y x 


c^ v 

o x y 


xxp 


Ala 


Ala 


Val 




w- 


■v. 

- > 


23 3 










310 










315 










320 










23 D 


AAG 


CTC 


ccc 


GGC 


AGC 


TCC 


GAG 


TCG 


CAG 


TGG 


TGC 


*rcc 


CCC 


X Uv, 


TAP 


GAG 


336 






O 1 £ 
ZoD 


Lys 


Leu 


Ala 

nXa 




Car 
OCX 




C~\ 11 


Con 

OCX 


Gin 


Trn 


Cys 


Cys 


Ala 


Cys 


Tvr 

x y x 


Glu 




w- 


- > 


z3 / 








325 










330 










335 
j -j j 












z o y 


CTG 


ACC 


ttp 


Arc 


AGC 


GGG 


CCG 


GTC 


GCG 


GGC 


AAG 


AAfl 
Any 


ATH 

nlu 


ATT 
t\x X 


GTG 


CAG 


384 






o ii n 

Z ffc u 


Leu 


Thr 




J. 11X 


Ser 


Gly 


Pro 


Val 


Ala 


Glv 


Lys 


Lys 


Met 


lie 


Val 


Gin 




w- 


- > 


Z4 1 






140 










345 










350 














Z4 J 


GCG 


ACC 


AAP 


app 


GGT 


GGC 


GAC 


CTG 


GGC 


GAC 


AAC 


CAC 
LnL 


TTT 

X X X 


CAC 


PTP 


GCC 


432 






OA A 
Z ft *i 


Ala 


Thr 


no 11 


-L 11X 


p 1 V 
vj xy 


ni v 

V3X jf 


A Qn 


Leu 


uxy 


Asp 


Asn 


His 


Phe 


Asp 


Leu 


Ala 




w- 


- ? 


Z4 3 




355 










360 










365 
















Z4 / 


ATC 


CCC 


GGT 


GGC 


GGT 


GTC 


GGT 


ATT 


TTC 


AAC 


GCC 


TCC 


Arc 


GAC 


CAG 


TAC 


480 






0 >1 Q 
Z 4 o 


He 


Pro 


Gly 


Gly 


Gly 


Val 


Gly 


He 


Phe 


Asn 


Ala 


Cys 


Thr 


Asp 


Gin 


Tvr 

j. j x. 




W- 


- > 


z4 y 


370 










375 










380 










385 








z j! 


GGC 


GCT 


CCC 


CCG 


AAC 


GGC 


TGG 


GGC 


GAC 


CGC 


TAC 




ccc 


ATP 

n X 


PAT 


TPP 


528 






252 


Gly Ala 


Pro 


Pro 


Asn 


Gly 


Trp 


Gly 


Asp 


Arg 


Tyr 


Gly 


Gly 


He 


His 


Ser 




w- 


-> 


253 










390 










395 










400 










255 


AAG 


GAA 


GAG 


TGC 


GAA 


TCC 


TTC 


CCG 


GAG 


GCC 


CTC 


AAG 


CCC 


GGC 


TGC 


AAC 


576 






256 


Lys 


Glu 


Glu 


Cys 


Glu 


Ser 


Phe 


Pro 


Glu 


Ala 


Leu 


Lys 


Pro 


Gly 


Cys 


Asn 




w- 


-> 


257 








405 










410 










415 












259 


TGG 


CGC 


TTC 


GAC 


TGG 


TTC 


CAA 


AAC 


GCC 


GAC 


AAC 


CCG 


TCG 


GTC 


ACC 


TTC 


624 






260 


Trp 


Arg 


Phe 


Asp 


Trp 


Phe 


Gin 


Asn 


Ala 


Asp 


Asn 


Pro 


Ser 


Val 


Thr 


Phe 




w- 


-> 


261 






420 










425 










430 














263 


CAG 


GAG 


GTG 


GCC 


TGC 


CCG 


TCG 


GAG 


CTC 


ACG 


TCC 


AAG 


AGC 


GGC 


TGC 


-TCC 


672 






264 


Gin 


Glu 


Val 


Ala 


Cys 


Pro 


Ser 


Glu 


Leu 


Thr 


Ser 


L Y S 


Ser 


Gly 


Cys 


Ser 




w- 


-> 


265 




435 










440 










445 
















267 


CGT 


CCC 


TCC 


AGC 


AGC 


ACC 


AGC 


TCT 


CCG 


GTC 


AAC 


CAG 


CCT 


ACC 


AGC 


ACC 


720 






268 


Arg 


Pro 


Ser 


Ser 


Ser 


Thr 


Ser 


Ser 


Pro 


Val 


Asn 


Gin 


Pro 


Thr 


Ser 


Thr 
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455 460 465 

W " _> 271 AGC ACC ACG TCC ACC TCC ACC ACC TCG AGC CCG CCA GTC CAG CCT ACG 768 
272 Ser Thr Thr Ser Thr Ser Thr Thr Ser Ser Pro Pro Val Gin Pro Thr 

470 475 480 

W " _> 275 ACT CCC AGC GGC TGC ACT GCT GAG AGG TGG GCT CAG TGC GGC GGC AAT 816 
276 Thr Pro Ser Gly Cys Thr Ala Glu Arg Trp Ala Gin Cys Gly Gly Asn 

W " _> 279 GGC TGG AGC GGC TGC ACC ACC TGC GTC GCT GGC AGC ACT TGC ACG AAG 864 

280 Gly Trp Ser Gly Cys Thr Thr Cys Val Ala Gly Ser Thr Cys Thr Lys 
W -> 281 500 505 510 

283 ATT AAT GAC TGG TAC CAT CAG TGC CTG TAG 

284 He Asn Asp Trp Tyr His Gin Cys Leu 
W--> 285 515 520 

288 (2) INFORMATION FOR SEQ ID NO: 4: 

290 (i) SEQUENCE CHARACTERISTICS: 

291 (A) LENGTH: 297 amino acids 

292 (B) TYPE: amino acid 

293 (D) TOPOLOGY: linear 
295 (ii) MOLECULE TYPE: protein 

297 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

299 Met His Leu Ser Ala Thr Thr Gly Phe Leu Ala Leu Pro Val Leu Ala 

300 1 5 10 15 

302 Leu Asp Gin Leu Ser Gly He Gly Gin Thr Thr Arg Tyr Trp Asp Cys 

303 20 25 30 

305 Cys Lys Pro Ser Cys Ala Trp Pro Gly Lys Gly Pro Ser Ser Pro Val 

306 35 40 45 

308 Gin Ala Cys Asp Lys Asn Asp Asn Pro Leu Asn Asp Gly Gly Ser Thr 

309 50 " 55 60 

311 Arg Ser 

312 65 

314 Ser Pro 

315 85 90 



311 Arg Ser Gly Cys Asp Ala Gly Gly Ser Ala Tyr Met Cys Ser Ser Glu 

312 65 70 75 80 

314 Ser Pro Trp Ala Val Ser Asp Glu Leu Ser Tyr Gly Trp Ala Ala Val 

315 85 90 95 
317 Lys Leu Ala Gly Ser Ser Glu Ser Gin Trp Cys Cys Ala Cys Tyr 



Glu 

318 100 105 * " HO 

320 Leu Thr Phe Thr Ser Gly Pro Val Ala Gly Lys Lys Met He Val Gin 

321 H5 120 125 

323 Ala Thr Asn Thr Gly Gly Asp Leu Gly Asp Asn His Phe Asp Leu Ala 

324 130 135 140 
Gly Gly Gly 

327 145 150 155 



324 130 ±J > J 

326 lie Pro Gly Gly Gly Val Gly He Phe Asn Ala Cys Thr Asp Gin Tyr 

327 145 150 155 

329 Gly Ala Pro Pro Asn Gly Trp Gly Asp Arg Tyr Gly Gly He His Ser 

330 165 170 175 

332 Lvs Glu Glu Cys Glu Ser Phe Pro Glu Ala Leu Lys Pro Gly Cys Asn 

333 180 185 190 

335 Trp Arg Phe Asp Trp Phe Gin Asn Ala Asp Asn Pro Ser Val Thr Phe 

336 195 200 205 

338 Gin Glu Val Ala Cys Pro Ser Glu Leu Thr Ser Lys Ser Gly Cys Ser 

339 210 215 220 

341 Arg Pro Ser Ser Ser Thr Ser Ser Pro Val Asn Gin Pro Thr Ser Thr 
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L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L : 
L: 
L: 
L: 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 



33 M:220 C: 

34 M:220 C: 
213 M:336 W: 
217 M:336 W: 
221 M:336 W: 
225 M:336 W: 
229 M:336 W: 
233 M:336 W: 



237 
241 



336 W: 
;336 W: 



:245 M:336 W: 
:249 M:336 W: 
:253 M:336 W: 
:257 M:336 W: 
:261 M:336 W: 
:265 M:336 W: 
:269 M:336 W: 
:273 M:336 W: 
:277 M:336 W: 
:281 M:336 W: 
:285 M:336 W: 
:375 M:336 W: 
:379 M:336 W: 
:383 M:336 W: 
:387 M:336 W: 
:391 M:336 W: 
:395 M:336 W: 
:399 M:336 W: 
:403 M:336 W: 
:407 M:336 W: 
:411 M:336 W: 
:415 M:336 



W 

:419 M:336 W 
:423 M:336 W 
:427 M:336 W 
:431 M:336 W 
:435 M:336 W 
439 M:336 W 
443 M:336 W 
:447 M:336 W 
:451 M:336 W 
:544 M:336 W 
:548 M:336 W 
:552 M:336 W 
:556 M:336 W 
:560 M:336 W 
:564 M:336 W 
:568 M:336 W 



Keyword misspelled or invalid 
Keyword misspelled or invalid 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number 
Invalid Amino Acid Number in 
Invalid Amino Acid Number in 



in 
in 
in 



in 
in 
in 



in 
in 
in 



in 
in 
in 



format/ 

format, 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 

Coding 



[(A) APPLICATION NUMBER : ] 
[(B) FILING DATE: ] 
Region, SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 3 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 5 
SEQ ID: 7 
SEQ ID: 7 . 
SEQ ID: 7 
SEQ ID: 7 



Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 
Region, 



SEQ ID: 7 
SEQ ID: 7 
SEQ ID: 7 
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VERIFICATION SUMMARY DATE: 02/04/2002 

PATENT APPLICATION: US/10/007 f 521 TIME: 11:36:16 

Input Set : N:\Crf3\RULE60\10007521.raw 
Output Set: N:\CRF3\02042002\J007521.raw 



L 


572 M: 336 W: 


Invalid Amino 


Acid 


Number 


in 


Coding Region, 


SEQ 


ID: 


7 


L 


576 M:336 W: 


Invalid Amino 


Acid 


Number 


in 


Coding Region, 


SEQ 


ID: 


7 


L 


580 M:336 W: 


Invalid Amino 


Acid 


Number 


in 


Coding Region, 


SEQ 


ID: 


7 


L 


584 M:336 W: 


Invalid Amino 


Acid 


Number 


in 


Coding Region, 


SEQ 


ID: 


7 


L 


3513 M:341 W 


(46) 


°n B 


or 


"Xaa" 


used, 


for 


SEQ ID# 


79 








L 


3529 M:341 W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ ID# 


80 








L 


3545 M:341 W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ ID# 


81 








L 


3561 M:341 W 


(46) 


n n „ 


or 


"Xaa" 


used, 


for 


SEQ ID# 


82 








L 


3577 M:341 W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ ID# 


83 








L 


3907 M:341 W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ ID# 


105 








L 


3922 M:341 W 


(46) 




or 


"Xaa" 


used, 


for 


SEQ ID# 


106 








L 


3937 M:341 W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ ID# 


107 








L 


3952 M:341 W 


(46) 


n n n 


or 


"Xaa" 


used, 


for 


SEQ ID# 


108 








L 


3967 M:341 W 


(46) 




or 


"Xaa" 


used, 


for 


SEQ ID# 


109 
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